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1 Introduction 



The Fibonacci substitution yQ, page 51 caught our interest because it defines a 
numeration system and we wondered if there are other substitutions having the 
same property. We present a class of substitutions which generate numeration 
systems. For more information about Automata and formal language theory, 
see |2] and for more on numeration systems see [3]. 

2 Definitions 

First we shall give a couple of definitions which we will use in this document. 

Definition 2.0.1 (Finite automaton). A finite automaton A = {S, A, S, I, F, Y, ip} 

is a tuple in which: 

• S is the finite set of states. 

• A is the finite set of labels. 

• <5CiSxAx<Sis the collection of transitions. 

• I C S is the collection of initial states. 

• F C S is the collection of final states. 

• Y the output alphabet. 

• (p is a function from S to Y named the output function or exit map. 

We represent an automaton by a directed graph with a set of vertices S called 
states, a set of edges S called transitions and specially marked subsets of states 
/ and F, the initial and final states. 

All through this document we shall take I — {i} as the only initial state and 
F = S as the collection of final states and we shall take A C N unless stated 
otherwise. 

Furthermore we shall usually take Y — S and ip = Id as the output alphabet 
and the exit map. 

Note that the output function applies to the states and not to the labels. 

Definition 2.0.2 (Regular language). The language C(A) of a finite automa- 
ton is called a regular language. The language of an automaton is the collection 
of strings that are accepted by the automaton (all paths in the automaton that 
lead from the initial state to a final state). 

Definition 2.0.3 (Regular expression). Let A be an alphabet. Then a reg- 
ular expression E over A is defined recursively as one of the following types: 

• 0. 

• e. 
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• a, where a € A. 

• {Ei U -B2), where -Ei and -E2 are regular expressions and the U operator 
denotes a union. 

• (Ex ■ E2), where E\ and E2 are regular expressions and the ■ operator 
denotes concatenation. 

Apart from the types in l2.0.3l we shall use some other notations. 

• E* , where E is a regular expression and the * operator denotes the union 
of all powers of E, so (E* = U n< znE n ). 

• E + is an abbreviation for E ■ E*. 

We usually omit the • in a regular expression. The U is sometimes written as 
'+' or 

Note that a regular expression must be of finite length. Otherwise we call it an 
infinite automaton. 

Definition 2.0.4 (Substitution). A substitution a is a function from an al- 
phabet ^4 to ^l* — {e} of nonempty finite words on A. It extends to a substitution 
on A* by concatenation. So a(ww') = a(w)a(w'). We set cr(e) = e, with e being 
the empty word. 

Definition 2.0.5 (ri-word). An n-word of a substitution a is the unique word 
<7™(i), l 6 A being the initial letter. 

Sometimes we need to refer to an element in an n-word. We use the notation 
<T n (b)i when we want to refer to the z-th element of o- n (i). 

Definition 2.0.6 (Fixed point). A fixed point of a substitution a is an infinite 
sequence u with o~(u) = u. 

Definition 2.0.7 (fc max ). Let AcN be the collection of labels. Define fc max = 
|A|-1. 

Equivalently, let a be a substitution. Define fc max as the greatest length of the 
images in a subtracted by 1. 

Definition 2.0.8 (Numeration system). In general a numeration system is 
a strictly increasing sequence U = (C/i)i G N such that 

• Uq = 1 (to represent all n € N), 

• sup^jr^- < 00 (to have a finite alphabet of digits). 

An expansion of an integer n £ N in such a numeration system is a finite se- 
quence (ai)k>i>o such that n = 53j=fc a iUi- We write this expansion as Ofc . . . ao, 
the most significant bit is in the first position. Note that there are more than 
one expansions in general, but one of them is called the normal or greedy rep- 
resentation. We shall discuss this in Section l5"Tl 
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It is quite natural to express the expansion of Ui as 1 00 . . . . It is also common 

i 

practice to give the following restriction < di < [sup U ^ +1 ] ■ 
In general more than one expansion can be found for an integer. 

Definition 2.0.9 (Full numeration system). A full numeration system is a 
numeration system that has the extra property: 

• If A = a,kcik-i ■ ■ ■ ao and B = bkbk-i ■ ■ ■ bo, with a; + bi < Ui for all i, then 
the sum A + B equals (ctfc + bk){ak-i + bk-i) ■ ■ ■ (ao + bo). 

For more on numeration systems, see chapter 7 and in particular Section 
7.3. 

3 Equivalence between substitutions and automata 
3.1 Substitutions in general 

A substitution a on an alphabet A defines an automaton in the following way: 

• Let S = A be the collection of states. 

• Add a transition from state a to state b (a,b 6 S) labeled i if b occurs in 
a (a) at position (i + 1). 

• Let l be the initial state. 

• Let all states in S be final states. 

Every automaton in turn defines a regular language (by definition) . 

Lemma 3.1.1 (Automata and substitutions). Let a be a substitution which 
is in bijection with an automaton A, let C(A) be the language of the automaton 
and let i > 0. Then a n (i)i is the state the automaton will be in after it is fed 
with the i-th word of length n of its (lexicographical ordened) input language. 

Proof. If we write the automaton as a computation tree, the equivalence is easier 
to see. We find cr"(t) by reading all states at depth n in the tree. Let us assume 
that all words in C(A) of length n are in order. Now we look at words of length 
n + 1. Every word of length n + 1 comes from a word of length n. At the i-th. 
word of length n, the automaton is in state a n {i)i. The words of length n + 1 
that are possible by extending the word of length n are exactly those which are 
extended with the elements from a n (i)i. These are added in order. ■ 

Example 3.1.2. In Figure 13.1.11 we see an automaton A in bijection with 
substitution a. 
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Figure 3.1.2: A computation tree in bijection with a 

The language accepted by both automata is 1* + (1*0(1 + 00)* (0 + e)). The first 
few elements of this language are: 

C(A)={ e, 0,1,00,01,10,11,000,010,011,100,101,110,111,...} 

12 3 

If for example we feed all strings of length 3 of its input language, we shall end 
respectively in the states b, c, b, c, b, b, a, which is exactly the string a 3 (a). 

3.2 Substitutions with fixed points 

Of course Lemma f3. 1.11 is also valid for substitutions with a fixed point. If we 
have a fixed point, the resulting automaton will accept leading zeroes in its input 
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language; leading zeroes do not change the state of the automaton, so 0*w ends 
in the same state as w. This is one of the conditions to make a numeration 
system. From now on we shall only look at substitutions with a fixed point. We 
shall also ignore words in C{A) with leading zeroes. 

Corollary 3.2.1 (Automata and substitutions with a fixed point). Let 

a be a substitution which is in bijection with an automaton A, let u be the fixed 
point of the substitution and let C(A) be the language of the automaton. Then 
Ui is the state the automaton will be in after it is fed with the i-th word of its 
input language. 

Proof. This follows directly from Lemma 13.1.11 H 



G0- 

Figure 3.2.3: An automaton with a fixed point 

Example 3.2.2. The fixed point of a shown in Figures \'A . 2 . 31 and 13 . 2 . 41 with a 
as the initial letter is: 

u = abcbbcbcbbcbbcbcbbcbcbbcbbcbcbbcbbcbcbbcbcbbcbbcbcbbc. . . 

and 

C(A) = {e, 0, 1, 10, 11, 100, 110, 111, 1000, 1001, 1100, 1110, 1111, 10000, . . .} 
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a- 




1 1 




= abcb 



1 1 1 




= abcbbcb 



Figure 3.2.4: A computation tree with a fixed point 



4 Numeration-automatism 

Although Corollary 13.2.11 is valid for all automata that are in bijection with 
a substitution which have a fixed point (that means a fairly large group of 
automata), it is not as useful as it might seem at first glance. The problem is 
that in general we can not give the n-th word of a language C a priory. 
However, there is an obvious class of automata for which we can give the n-th 
word, the so-called fc-automata. In this class each letter has a substitution word 
of length k. For this class of automata the n-th word of its input language is 
the A;-base expansion of n. 

There are also some non-fc-automata for which we can describe the n-th word 
without much calculation. This class has the property that we can define a 
numeration system in which the expansion of n is the n-th word of C(A). We 
shall refer to these substitutions as numeration-automatic substitutions. In gen- 
eral however, a substitution is neither k- nor numeration-automatic. Sometimes 
there even exist numbers n for which we can not find a valid expansion. 

4.1 The Fibonacci substitution 

The Fibonacci substitution and its automaton can be seen in Figure 14.1.51 




it 

(0 + 10)*(e + l) 

Figure 4.1.5: The Fibonacci automaton 



If we take a as the initial letter the substitution gives the following fixed point: 
u = abaababaabaababaababaabaababaabaababaababaabaababaaba. . . 
The first elements of the language the automaton defines are: 
C(A) = {e, 0, 1, 10, 100, 101, 1000, 1001, 1010, 10000, 10001, 10010, 10100, . . .} 

This language has no consecutive ones. We will call the (n + 2)-th word in this 
sequence the Zeckendorf expansion of an integer n. 

Definition 4.1.1 (The Fibonacci sequence). Let (Fi)jgN be the sequence of 
integers defined by Fq — 1, F\ — 2 and for any integer i > 1, -Fj+i = Fi-i + Fi- 

Definition 4.1.2 (The Zeckendorf expansion). If n = J2i=o a i^i with au = 
1, cii 6 {0, 1} and V(i < k){a,i(ii + i — 0}, we say that Zeck(n) = a^au-i ■ ■ ■ ao G 
{0,l} fc+1 is the Zeckendorf expansion of the integer n. 

The Zeckendorf algorithm is actually an instance of the greedy, or Euclidean 
algorithm. We shall discuss this in more detail in Section l5~Tl 
If we write the partitions of N as F a and 

F a = {n e N,Zeck(n) G {0,1}*0} 

F b = {n e N,Zeck(n) G {0,1}*1} 

Then 

F Q = {0,2,3,5,7,8,10,11,13,15,...} 

¥ b = {1,4,6,9,12,14,17,19,22,25,...}. 

Hence we get an a at position n if n ends with a in the Zeckendorf expansion, 
we get an b otherwise. 

So if we want to calculate the n-th word of the language this automaton defines, 
we only have to expand n with the Zeckendorf algorithm. 
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4.2 Fibonacci's 'brother' 

In Figure: [4.2.61 we see another substitution that is numeration-automatic. 



a — » a6 
6^6 



0*(e + 10*) 



Figure 4.2.6: A Fibonacci-like automaton 



Example 4.2.1. The numeration system for this substitution is based on the 
sequence (n) n ^+. So the first few expansions are as follows: 

-> 

1 -> 1 

2 -> 10 

3 -> 100 

4 -> 1000 



5 A generalization 

Can we find a more general class of automata that have similar properties? We 
will show that the answer is yes for a (possibly small) class of substitutions. 
This class can be found by calculating the sequence on which the expansion is 
based from the substitution itself. If we look at the Fibonacci substitution, we 
see that for each word a n : 

\o n \a — \o n -\\a + |Cn-l|h 
Wn\b = l^n-ila 



Or in matrix form: 

F = 

If we now define the initial matrix as 



1 1 
1 



1 


and multiply it from the left repeatedly with the (2 x 2) matrix F, we obtain: 
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and if we add the elements of the matrices we obtain the Fibonacci sequence 
(note that the elements of the vectors also form the Fibonacci sequence). 

The matrix for the 'brother' of the Fibonacci sequence is 



By using this method we find a numeration system for any automaton. And by 
using a greedy generalized Zeckendorf expansion we can cover all fc-automata, 
the Fibonacci automaton and its 'brother'. 

Lemma 5.0.2 (Substitutions and numeration systems). If a substitu- 
tion a has a fixed point, then the sequence t|, |cr(t)|, er 2 (t)|, ... is a numeration 
system. 

Proof. A sequence of integers is a numeration system when it complies to the 
restrictions of Definition 12. (J. 81 The first demand is adhered to because t is a 
letter, so \t\ = 1. The second demand is adhered to as well, because sup^^- < 
fc max and the sequence is infinite, otherwise there would be no fixed point. ■ 

Note that this proof also holds for most substitutions, as long as cr„(t) < a n+1 (i). 
5.1 The expansion algorithm 

From a substitution we can extract a numeration system, but we still need an 
expansion algorithm to generate C(A). 

Definition 5.1.1 (General Zeckendorf expansion). Let U be a full numer- 
ation system. If n = 53i=o a &i-> with < <Zj < fc max for i = 0, 1, . . . , k — 1 and 
at > 0, we say that akau-i ■ ■ ■ clq is the expansion of n in the U numeration 
system. 

This algorithm is also known as the greedy or Euclidean algorithm. Hollander 
0] has described the class of recurrent functions which describe a numeration 
system of which the Euclidean expansion is recognized by automata. This how- 
ever is not the entire class of numeration-automata. We shall discuss this class 
in Theorem 157031 

Automatic expansion Let U be a numeration system, let A be an automaton 
and let A m be the state in which the automaton will be after reading the first 
m letters of an input word. Let t(A m ) be the set of outgoing transitions of state 
A m . Assume that t(A m ) — {0, 1, ... , |t(A m )| — 1}. If we can write any integer 
n > as n = J"V_ aiUi with a$ £ t(Ai) for i = 0, 1, . . . , k — 1 and > in 
a unique way such that the automaton accepts the expansion, then Auto(n) = 
akak-i ■ ■ ■ clq is said to be the automatic expansion of the integer n. 




which applied to the initial matrix yields the sequence 
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Since we are only interested in a mapping from N + to the regular language the 
automaton accepts, it makes sense to look at it this way: does the automatic 
expansion of an integer represent the integer itself? 

We have devised an algorithm that should answer that question for any substi- 
tution: 

Definition 5.1.2 (Automatic expansion). The automatic expansion is an 
extension of the standard greedy algorithm. 

1. Given n and the numeration system (f/j)ieN- Let Ui be the largest element 
in U such that n < Ui(\t(A )\ - 1). Let m = 1. 

2. Let Oj be the largest element in t(A m ) such that <2j[/j < n 

3. Replace n by n — atUi, i with i — 1, m with m + 1 and repeat step|21until 
n = 0. 

If this algorithm fails, the number n could not be expanded. 




(0 + 1 + (202*(0 + 1)))*202* + 2 + e 

Figure 5.1.7: A numeration-automaton 

Example 5.1.3. The substitution shown in Figure 15. 1 . 71 defines the following 
numeration system: 

{1, 3, 7, 17, 43, 109, 275, 693, 1747, 4405, 11107, . . .} 

Suppose we want to expand the decimal number 41. 

Ui = 17, this makes i = 3. The expansion goes as follows: 

The first element is 2 because 2 • 17 < 41 < 3 • 17, this leaves 41 - (2 • 17) = 7 
to be expanded. Go to state b (follow label 2). 

The second element is because this is the only transition going out of state b. 
Go to state c. 

The third element is 2 because 2 ■ Ui — 2 • 3 < 7, this leaves 7 — (2 • 3) = 1 to be 
expanded. Go to state c (follow label 2). 

The last element is 1. We conclude that this automaton can expand the number 
41 correctly with the given algorithm. 

Observe that the generalized Zeckendorf expansion does not work in this case. 
The greedy algorithm would have expanded the number as 2100, but this is not 
accepted by the automaton. 
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6 Survey 



Now we look at which substitutions are numeration-automatic. We have already 
seen that the Fibonacci automaton and the fc-automata have this property. 

The extended Fibonacci automata Figure 16*. (J . 81 gives another example of 
a substitution of which we shall prove that it is numeration-automatic. 




(0 + (l(10)*0))*(e + (l(10)*(e+l))) 

Figure 6.0.8: A numeration-automaton 

Example 6.0.4. 

From Figure If). 0.81 on we will omit the regular expressions because it is tedious 
work and because the automaton gives a more insightful picture of the language 
than the regular expression. 

Just like the Fibonacci automaton, this one has a couple of 'brothers and sisters', 
the most important of which are shown in Figures 16.0.91 16.0.101 and 16.0.111 




Figure 6.0.9: A numeration-automaton 




Figure 6.0.10: A numeration-automaton 
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Figure 6.0.11: A numeration-automaton 



We can keep adding states as shown in Figure l6~0. 121 




Figure 6.0.12: A numeration-automaton 



We can further increase the number of transitions. See e.g. Figure 




Figure 6.0.13: A numeration-automaton 



And here we stumble upon a class of automata which have incidence matrices 
of the form 







2=0,1 


2=0,2 


2=0,3 


■ ■ ■ X ,n 




1 











■ ■ ■ 2=l,n 







1 








• • • X2,n 










1 





■ ■ ■ 2=3,n 


V 













1 *£n,n 



Here xo.o must be larger or equal to 1 and 2=0,1,2:0,1 . . . xo, n and xi <ni xi, n ■ ■ ■ 2= 
may be any value between and k max . 
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Figure 6.0.14: A non-numeration-automaton 



Having an incidence matrix of this form is not sufficient, because the substitution 
in Figure 16.0. 141 has such a matrix, but is not numeration-automatic. 
The expansion sequence is as follows: 

{1, 2, 4, 7, 13, 24, 44, 81, 149, 274, 504, 927, . . .} 

The automaton crashes when we try to expand the number 5. 

Theorem 6.0.5 (co-automatism). Let <jo,...,a m be a substitution and let 
< i,j < to. // 

oo — > o'o +0 '* 

and 

then the associated automaton is numeration- automatic. 

Proof. The numeration system (£7i)ieN of a is given by t/j = |<7*(t)|. Consider a 
level i of the infinite tree associated with a, and number the nodes from to 
|cr (t)| = U(. Consider a string a^_i . . . aido leading to state x with number n 
at level i. 

Call the states that the path will encounter: xi, ...,x%,x. Each node Xi has 
siblings to the left and all these siblings are labeled by l. In the % remaining 
steps these t's generate |cr* (t) | symbols to the left of x. The total number is 

i-i t-x 

j=0 (=0 

Thus the representation of n by the automaton is ai-i . . . ao too. ■ 

7 Combining automata 

The automata described above are 'basic' automata, which means that the au- 
tomata define their own numeration system and that the automata are minimal. 
With minimal we mean that there is no automaton that defines the same nu- 
meration system, but has less states. We can use these automata as a basis for 
other automata. 
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The product automaton Let A and B be automata, let i be the initial state 
of an automaton. Let Sa be the states of A and Sb the states of B. We define 
a superstate as a state consisting of a tuple (a, b) with a G 5^ and 6 G iSg. We 
make the product automaton as follows. 

• Start in the superstate (t, l) and make this state the initial state. 

• If both A and B have a transition from t labeled i, make a new superstate 
(a, 6) (the endpoints of the transitions in both automata) and make a new 
transition from (t, l) to (a, b) labeled i. 

• Do the same for all other states. 



:i3 



x 




(6,a)| ^-l— [(M) 




(a, a) 



(6,6) 




Figure 7.0.15: Combined automata 



Example 7.0.6. In Figure 17. 0.1 51 we have combined the Fibonacci automaton 
with the Prouhet-Thue-Morse automaton by applying the product construction 
to the two automata. 

The result is an automaton that generates the Prouhet-Thue-Morse sequence in 
the Fibonacci numeration system. By using the following exit map (projection 
on the second coordinate): 



a 
b 
b 
a 



We obtain the Prouhet-Thue-Morse sequence again, and by using this exit map 
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(projection on the first coordinate): 



a 
b 
c 
d 



a 
b 
a 
b 



We get the Fibonacci sequence again. 



Combining automata can result in a substitution for which it is not directly 
clear that it is numeration-automatic. 

Example 7.0.7. When we combine the substitutions 



a - 


-> ab 


b - 


-> ac 


c 


■+ cc 



with 



ab 

c 

ac 



we obtain (after renaming a := (a,a),b := (b,b),c := (a,c),d := (b,c),e := 
(c,c),/ := (c,a),g:= (c,6)) 



&AxB '■ < 



a 
b 
c 
d 
e 
f 
9 



ab 

c 

ad 
ae 
fe 

fg 

e 



and the induced incidence matrix is not of the previously defined form 



/ 1 





1 


1 








\ 


1 























1 























1 























1 


1 





1 














1 


1 





V o 














1 


o J 



Still this substitution is numeration-automatic, because this automaton has ex- 
actly the same behavior as its 'parents'. 

Theorem 7.0.8 (Product of co-automata). The product automaton o/cto- 
automata is a q- automatic. 
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Proof. Every state x has \t(x)\ outgoing transitions, the first \t{x)\ — 1 of them 
are pointing to the initial state. We determine the product of two states x 
and y. Assume that \t(x)\ < \t(y)\, this will result in a node with the first 
\t(x)\ — 1 transitions pointing to (t, t). Therefore the resulting automaton is 
(To-automatic. I 

Theorem 7.0.9 (Product of <jq- and fc-automata). The product automaton 
of a ao-automaton and a k-automaton is numeration-automatic. 

Proof. If the fc max of the <7o-automaton is larger than the fc of the fc-automaton, 
then the fc max of the product automaton will be k. Thus, when we leave out all 
transitions higher than k in the original co-automaton, the resulting product 
automaton will be the same. Therefore we may assume that fc max < fc. 
Consider the computation tree of the ao-automaton. When we apply the product 
construction to this tree, the structure of the tree does not change but the states 
are re-labeled. The first coordinate indicate the original state. Thus Theorem 
16. 0.51 still applies. The fixed point of the substitution is still computed correctly 
by the automaton because of Lemma 13.1.11 I 

The product automaton of two fc-automata is fc-automatic. If for example we 
construct the product automaton of a 2- and a 3-automaton, the result will be 
a 2-automaton. 



8 Reverse reading 

We know from the theory of fc-automata [I], page 15 that if a fc-automaton in 
direct reading exists, there also exists a fc-automaton in reverse reading that 
accepts the same input language and gives the same mapping to the output 
alphabet. The only difference is that the automaton in reverse reading reads 
the elements of its input from right to left instead of the normal order. The 
proof of this relies upon the existence of a fc-kernel. 

In general we can not make a fc-kernel. However, we can construct an automa- 
ton in reverse reading from an automaton in direct reading without having to 
construct a kernel. 

We know this is possible because the theory of formal languages 0, page 419 
states that a regular language is closed under the operation of mirroring, but this 
theory does not give an algorithm to make such an automaton. This is because 
the theory of formal languages does not apply to fc-automata and numeration- 
automata directly. For example, the Prouhet-Thue-Morse automaton should be 
reduced to an automaton with one state and two loops with labels and 1 
according to this theory, because this is the minimal automaton that accepts 
{0, 1}*. Moreover, all 2-automata should be reduced to this automaton. 

Reversing an automaton To reverse an automaton, we only have to make 
all final states initial states and vice versa and we must change the direction 
of the transitions. By doing this, we probably end up with a non-deterministic 
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automaton. Fortunately, the non-deterministic automaton can be converted to a 
deterministic one using the subset construction 0, page 118. This construction 
does not take into account that the set of final states may have partitions. Since 
our automata have output in their final states, the output induces a partition of 
the set of final states. General automata do not have an output in their states, 
a state is simply a final state or not. 

Analogous to the subset construction we make our automaton in reverse reading, 
but we take into account the possibility of partitions by including the output 
function in the state. 

Example 8.0.10. We have an automaton shown in Figure 15.0.161 Since the 
automata we are going to make do not necessarily consist of final states only, 
we shall mark the final states in the following automata with an inner circle. 




Figure 8.0.16: A deterministic automaton in direct reading 

• First we swap final- and initial states and we change the direction of the 
transitions as shown in Figure 15. 0.1 71 




Figure 8.0.17: A non-deterministic automaton in reverse reading 

• Now we shall apply the subset construction [2], page 120 on each of the 
initial states as if we were dealing with three automata. We shall denote 
the state of the three automata in one state, so if the automata are in states 
{a}, {b}, {c} respectively, we shall notate this as {{a}, {&}, {c}}, because a 
is the initial state of the first automaton, b the initial state of the second 
and c is the initial state of the third automaton. 




Figure 8.0.18: Automaton 1 
19 




Figure 8.0.19: Automaton 2 




Figure 8.0.20: Automaton 3 

Reading a in state {{a}, {&}, {c}} will result in state {{a}, 0, {b, c}}, 
because automaton 1 stays in state a when reading a 0, automaton 2 
crashes when reading a in state b and automaton 3 goes to state {b, c} 
because there are two outgoing branches labeled in state c. 

We also have to determine the output function of each state. This is done 
by observing in which coordinate the initial symbol (in this case a) of 
the original automaton is. For example the state {{a}, 0, {&, c}} will have 
a as output and the state {0, 0, {a, b, c}} will have c as output. This is 
because when we have reached the initial state in reverse reading, we have 
also reached a final state in direct reading. For clarity, we write the output 
preceded by a slash. In the first instance we write {{a}, {&}, {c}/a} If no 
output can be found, we use e as output. This means that the state in 
question is not a final state. 

When we start in state {{a}, {&}, {c}/a} and follow the labels and 1, 
we get two new states. We now apply the same construction to these new 
states. The result is shown in Figure IS. 0.211 
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0, 0, {a, 6, c}/c 




Figure 8.0.21: A deterministic automaton in reverse reading 



If we simplify the nodes to standard notation, we get the automaton 
shown in Figure 151) .2 21 
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Figure 8.0.22: A simplified deterministic automaton in reverse reading 

9 Numeration systems 

The class of numeration-automatic substitutions has an interesting subclass: the 
subclass of substitutions that define a full numeration system. We believe that 
this is the class of cr - au tomata with the restriction that the cardinality of the 

images of the substitution do not increase, so |tr | > |oi| > In this section 

we show that the condition suffices. 

Without loss of generality, we can write a er - s ubstitution in the following way 

cr — ► Co a\ 

CTfe-l — ► CToVfc 
Cfc — * O"o*cr* 

We assume this special form throughout this section. 
First we derive a lemma for general co-automata. 

Lemma 9.0.11 (The recurrent function of a substitution.). If a is a 

substitution of a - automatic type with k substitution rules, then the numeration 
system is a linear recurrent function of at most order k. 

Proof. Since each substitution rule is of the form 

(Ti 0#CT i+ l, ji > 
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and the first rule is of the form 

(T (jjVi, j > 0, 

we can extract part of the recurrent function from the first substitution rule. 
This results in a relation depending on do and a\: a n = joa n -i + b n -\- Hence 
b n -i can be expressed as a n — joa n -i, analogously we can write the next equation 
as b n = jia n -i +c„_i. This yields c„_i = b n - jia„_i = a n+1 - j a n - jia„_i. 
In this way we can successively write 6„_i, c n _i, d n -\, ... as linear combinations 
of a ni a n+1 , a„ +2 , .... Finally we get a linear homogeneous recurrence relation 
with constant coefficients of the numbers a n . The order of this recurrence equals 
the number of substitution rules. ■ 

Example 9.0.12. Consider the following substitution scheme. 

a — > ab 

b — ► aac 

a '■ \ j 

c — > a 

d — > ac 

v 

First we write the substitution rules as recurrent functions. 

On = ttti-l + &n-l 

&„ = 2a n _i + c„_i 

c n — d n —i 
da ~t~ Cn— 1 

Now we start eliminating 

b n -i 
c n -i 

d n -l 

So 

fl n+3 — «ri+2 — 2a„ + i = fl tl _i + a Il+ i — a„ — 2a„_i 

a n+3 = «n+2 + 3a„ + i — a„ — a„_i 

And the final result is: a n = a n _i + 3a„_ 2 — a„_ 3 — a„_4, which is the recur- 
rence that generates the sequence 1, 2, 5, 10, 22, 45, 96, 199, 420, 876, . . . (with the 
appropriate initial conditions \a\, \cr(a)\, er 2 (a)|, |cr 3 (a) | ) . 

Theorem 9.0.13 (Full numeration systems and numeration-automatism). 

If a is a a substitution, with |ct(cto)| > |er(cri)| > ... > \a(ak)\, then the asso- 
ciated automaton generates a full numeration system. 



d n a n ^\ 
= b n — 2a n -i = a n+ i — a n — 2a„_i 
= c n — a n+ 2 — a n +i ~ 2a„ 
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Proof. Consider the computation tree associated with the substitution a. Choose 
n such that |cr n (cr )| < x < \a n+1 (a a )\- 

Let a be a state with <r(a) = o~ "a' , with a' £ {e, <7i, . . . , afc}. 
The state a results in |er m+1 (a)| states m levels deeper in the tree. Because 
\a m (ao)\ = U m , these states can be partitioned in £ a sets of U m states and one 
set of |tr(a')| states with \a m (a')\ < U m because a (a') < ct(cto). 
We now iterate the following procedure starting with the triple (x, ao,n). 
Consider the triple (x,a,m) with x an integer such that x < U m +i and a a 
state. We write x = £ m U m + x' with < x' < U m . Then l m < < fc max + 1, 

hence £ m < A: max . If £ m < £ a , then x is the result of state <7o when we go m 
levels higher. If £ m = £ a , then x is the result of state a' when we go m levels 
higher. In the former case, we replace the triple (x,a,m) by (cc',cro,m — 1), in 
the latter case by (x',a',m — 1). Recall that if a' = Cj, a = o~j, then i > j and 
therefore |c(ci)| < |cr(crj)|. 

Obviously x — £ n U n + £ n -iU n -i + . . . + £oUo- However, by construction we 
have that £ n £ n -i . . . £q is the expansion of x in the computation tree. Hence the 
numeration system is full. I 

We remark that the extra condition can not be dropped. See Example 15. 1.31 

10 Conclusion 

We have seen that with the automatic expansion we can successfully find numer- 
ation systems for a class of automata. The most important findings are that the 
substitution defines a numeration system and that the automaton defines an ex- 
pansion algorithm. Combine them and we get the class of numeration-automatic 
sequences. 

The sequences which are not numeration-automatic remain interesting, because 
the associated automaton calculates most of the letters in the fixed point cor- 
rectly, but it leaves gaps. Maybe it is somehow possible to 'repair' the automaton 
to correct this behavior, for example by using a stack automaton, but this is 
beyond the scope of this document. 

11 More on numeration-automatism 

The following website has a program that checks a substitution for numeration- 
automatism. 

\protect\vrule widthOpt\protect\href {http : //www. liacs .nl\string~ jlaros/semi/Mhttp : //www. lii 

The "On-Line Encyclopedia of Integer Sequences" is a huge database of integer 
sequences. The author has contributed some sequences and commented on some 
other sequences. See 

\protect\vrule widthOpt\protect\href {http : //www. research. att . com\string~njas/ sequences/Seis 
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for the main page, or one of the following pages. 

\protect\vrule widthOpt\protect\href {http : //www . research . att . com/cgi-bin/access . cgi/as/nj as, 
eisA . cgi?Anum=A000045 

\protect\vrule widthOpt\protect\href {http : //www. research. att . com/cgi-bin/access . cgi/as/nj as, 
eisA.cgi?Anum=A101197 

\protect\vrule widthOpt\protect\href {http : //www. research. att . com/cgi-bin/access . cgi/as/nj as, 
eisA.cgi?Amim=A101168 

\protect\vrule widthOpt\protect\href {http : //www . research . att . com/cgi-bin/access . cgi/as/nj as, 
eisA.cgi?Anum=A101169 

\protect\vrule widthOpt\protect\href {http : //www. research. att . com/cgi-bin/access . cgi/as/nj as, 
eisA . cgi?Anum=A101399 

\protect\vrule widthOpt\protect\href {http : //www. research. att . com/cgi-bin/access . cgi/as/nj as, 
eisA . cgi?Anum=A101400 
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